• Notes:
    1. AY2017/2018 Semester 2, AY2018/2019 Semester 2, bidding data not available.
    2. The bidding statistics are highly non-normal, likely due to being bounded by zero (they cannot bid negative points or have negative bidders). May consider using zero-inflated or poisson regression if considering these statistics as dependent variables.

Phase 1: Setting Up Environment, Packages And Loading Data.

Phase 2: Filter, Transform And Merge Data

>myModInfo

  • Filter Module Information, myModInfo.
    • Removing non-Psychology modules.
    • Removing tutorial information.

>>Filter

>myBid

  • Filter CORS Bidding Information, myBid.
    • Removing non-Psychology modules, including Roots and Wings (PLS) and Psychology for non-Psychology students (PLB).
    • Removing information from reserved modules.
    • Removing information from modules with more than one lecture/seminar session.

>>Filter

Phase 3: Data Diagnostics

  • Plot univariate histograms and bivariate plots using loops for almost every combination of variables.
  • The graphs from this section are predominantly for diagnostics rather than exploration, what I mean is that the graphs from this section would make little sense if one tried to draw insights from them. This is because they are aggregated across all other variables.
    • For example: The mean of Bidders is calculated across all academic years, all bidding rounds, all modules…
  • What I am looking out for in this section are odd patterns, like zeroes in places where they shouldn’t be, missing data, highly non-normal data, variables with outliers, etc…

Univariate Descriptive Statistics

##       AcadYear   Semester Round      ModuleCode       Quota           Bidders         LowestBid       LowestSuccessfulBid   HighestBid         StudentAcctType      DayText      StartTime   
##  2013/2014:329   1:799    1A:496   PL3232 :  74   Min.   :  1.00   Min.   :  0.00   Min.   :   0.00   Min.   :   0.0      Min.   :   0.0   New[P]      :253    Monday   :308   Min.   : 800  
##  2015/2016:293   2:904    1B:290   PL3236 :  72   1st Qu.:  4.00   1st Qu.:  0.00   1st Qu.:   0.00   1st Qu.:   0.0      1st Qu.:   0.0   NUS[P]      :246    Tuesday  :312   1st Qu.:1000  
##  2014/2015:283            1C:176   PL3233 :  71   Median : 15.00   Median :  3.00   Median :   1.00   Median :   1.0      Median : 247.0   Return[P]   :894    Wednesday:462   Median :1200  
##  2012/2013:267            2A:243   PL3235 :  71   Mean   : 22.65   Mean   : 11.79   Mean   :  64.12   Mean   : 231.7      Mean   : 689.3   ReturnNew[P]:310    Thursday :356   Mean   :1293  
##  2016/2017:205            2B:252   PL3234 :  70   3rd Qu.: 32.00   3rd Qu.:  8.00   3rd Qu.:   3.00   3rd Qu.: 101.0      3rd Qu.:1178.5                       Friday   :265   3rd Qu.:1500  
##  2011/2012:176            3A:129   PL2132 :  63   Max.   :197.00   Max.   :208.00   Max.   :2430.00   Max.   :3459.0      Max.   :4801.0                                       Max.   :1800  
##  (Other)  :150            3B:117   (Other):1282

Bivariate Plots

Continuous-Categorical

## AcadYear ~ Quota

## AcadYear ~ Bidders

## AcadYear ~ LowestBid

## AcadYear ~ LowestSuccessfulBid

## AcadYear ~ HighestBid

## AcadYear ~ StartTime

## Semester ~ Quota

## Semester ~ Bidders

## Semester ~ LowestBid

## Semester ~ LowestSuccessfulBid

## Semester ~ HighestBid

## Semester ~ StartTime

## Round ~ Quota

## Round ~ Bidders

## Round ~ LowestBid

## Round ~ LowestSuccessfulBid

## Round ~ HighestBid

## Round ~ StartTime

## StudentAcctType ~ Quota

## StudentAcctType ~ Bidders

## StudentAcctType ~ LowestBid

## StudentAcctType ~ LowestSuccessfulBid

## StudentAcctType ~ HighestBid

## StudentAcctType ~ StartTime

## DayText ~ Quota

## DayText ~ Bidders

## DayText ~ LowestBid

## DayText ~ LowestSuccessfulBid

## DayText ~ HighestBid

## DayText ~ StartTime

Phase 4: Creation Of New Variables

Relevant Statistics About Data

Number Of Iterations Per Module

Datatable